IN THE CLAIMS: 

^ 

Please amend claims 3 and 4 as follows: 



1 . (Original)fA\|Tiethod comprising: 
inputting/dpe^ch representing an utterance and having an intonation; and 

identifying kn endpoint of the utterance based on the intonation. 

2. (OriginalnA method as recited in claim 1 , wherein said identifying an endpoint 
of the utterance based on the intonation comprises comparing the intonation with 
an intonation model. 

3. (Currently amended) A method as recited in claim 4 L further comprising 
determining the intonation by computing the fundamental frequency of the 
utterance. 

4. (Currently amended) A method as recited in claim 3 1, wherein said 
determining the intonation comprises using an intonation model to determine the 
intonation. 



5. (Original) A method\as recited in claim 1 , wherein said identifying the 
endpoint of the utterance, comprises identifying the endpoint of the utterance 
based on a plurality of knowledge sources, wherein one of the knowledge 
sources is intonation, including referencing the input speech against a histogram 

each of 



based on training data for et 



the knowledge sources. 



. (Original) A method as recited in claim 1 , further comprising: 

determining a period of time that has elapsed since the speech dropped 

below a thresholol value; and 

wherein said, identifying an endpoint of the utterance comprises identifying 

the endpoint of the utterance further based on the period of time. 

7. (Original) Apjetnod as recited in claim 1 , wherein said identifying an endpoint 
of the utterance V)r)mrises identifying the endpoint of the utterance further based 
on a length of time fbrvvhicn an energy value of the speech has remained below 
a predetermined energy value. 

8. (Origmal) A method as recited in claim 7, wherein said identifying an endpoint 
of the utterance further comprises identifying the endpoint of the utterance based 
on the duration of the final syllable of the utterance. 

9. (Original! A method of operating an endpoint detector, the method 
comprising: I 

inputting speech representing an utterance, the utterance having an 
intonation; ana 

comparing the intonation of the utterance with an intonation model; 
determining a probability based on a result of said comparing; and 
identifying an endpoint of the utterance based on the probability. 
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10. (Original) A method as recited in claim 9, further comprising determining the 
intonation of the utterance as a function of the fundamental frequency of the 
utterance.! 



1 1 . (Original) A method as recited in claim 9, further comprising: 

determining a period of time that has elapsed since a value of the speech 
dropped below a threshold value; and 

wherein said identifying an endpoint of the utterance comprises identifying 
the endpoint of the utterance further based on the period of time. 

12. (Original) A method as recited in claim 9, wherein said identifying an 
endpoint of the utterance comprises identifying the endpoint of the utterance 
further based onlthe duration of the final syllable of the utterance. 

13. (Original) A method as recited in claim 12, wherein said identifying an 
endpoint of the utterance comprises identifying the endpoint of the utterance 
further based on alperiod of time for which an energy value of the speech has 
remained below a threshold value. 



14. (Original) A method of operating an endpoint detector for speech 
recognition, the method comprising: 

inputting speech representing an utterance; 

determining tha\ a value of the speech has dropped below a threshold 

value; 

computing an intonation of the utterance; 
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referencing the intonation of the utterance against an intonation model to 
determine a first end-of-utterance probability; 

determining a period of time that has elapsed since the value of the 
speech dropped below the threshold value; 

referencing the period of time against an elapsed time model to determine 
a second end-of-utterance probability; 

computing an overall end-of-utterance probability as a function of the first 
and second end-of-utterance probabilities; and 

determining whether an end-of-utterance has occurred based on the 
overall end-of-utterance probability. 

15. (Original) Aunethod as recited in claim 14, wherein said computing an 
intonation of the Utterance comprises computing an intonation of the utterance by 
determining the fundamental frequency of the utterance as a function of time. 

16. (Original) A method as recited in claim 15, further comprising: 

determining a duration of a final syllable of the utterance; and 
referencing the duration of the final syllable against a syllable duration 

model to determine a third end-of-utterance probability; 

wherein said computing an overall end-of-utterance probability comprises 

computing the overall end-cVutterance probability as a function of the first, 

second, and third end-of-utterance probabilities. 



1 7. (Original) A method of operating an endpoint detector for speech 
recognition, the methodlcomprising: 

inputting speech representing an utterance; 
computing an intonation of the utterance; 

referencing the intonation of the utterance against an intonation model to 
determine a first end-of-uperance probability; 

determining a duration of a final syllable of the utterance; 

referencing the duration of the final syllable against a syllable duration 
model to determine a second end-of-utterance probability; 

computing an overall end-of-utterance probability as a function of the first 
and second end-of-utterance probabilities; and 

determining whether an end-of-utterance has occurred based on the 
overall end-of-utterance probability. 

18. (Original) A method as recited in claim 17, wherein said computing an 
intonation of the utterance comprises computing an intonation of the utterance by 
determining the fundamental frequency of the utterance as a function of time. 



19. (Original) A method as recite^ in claim 17, further comprising: 

determining that a value of the speech has dropped below a threshold 

value; 

determining a period of time that has elapsed since the value of the 
speech dropped below the threshold lalue; and 



referencing the period of time against an elapsed time model to determine 
a second end-of-utterance probability; 

wherein said computing an overall end-of-utterance probability comprises 
computing the overall end-of-utterance probability as a function of the first, 
second, and third end-cf-utterance probabilities. 

20. (Original) A method of operating an endpoint detector for speech 
recognition, the method comprising: 

inputting speech representing an utterance, the utterance having a time- 
varying fundamental frequency; 

determining that a value of the speech has dropped below a threshold 

value; 

computing an intonatbn of the utterance by determining the fundamental 
frequency of the utterance as a function of time; 

referencing the intonation of the utterance against an intonation model to 
determine a first end-of-utterance probability; 

determining a period of tune that has elapsed since a value of the speech 
dropped below the threshold value; 

referencing the period of time against an elapsed time model to determine 
a second end-of-utterance probability; 

determining a duration of a jfinal syllable of the utterance; 
referencing the duration of the final syllable against a syllable duration 
model to determine a third end-of-utterance probability; 



-of-i^tte 
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computing an overall end-of-utterance probability as a function of the first, 
second, and third end-of-utterance probabilities; and 

determining whether an end-of-utterance has occurred by comparing the 
overall end-of-utterance probability to a threshold probability. 

21 . (Original)\ A method of operating an endpoint detector for speech 
recognition, theVnethod comprising: 

inputting speech representing an utterance; 

determining an intonation of the utterance; 

if the intonation\of the utterance is determined to be generally decreasing, 
then setting a threshold/tjme^period equal to a first time value; 

if the intonatiofiW thV utterance is determined not to be generally 
decreasing, then setting the threshold time period equal to a second time value 
larger than the first time value; and 

identifying an endpoint of the utterance based on the threshold time 

period. 



22. (Original) A method as recited in claim 21 , wherein said using the threshold 
time period to identify an endpoint of the utterance comprises using the threshold 
time period to identify an endpoint of the utterance by determining that an 
endpoint of the utterance has occurred if an energy value of the speech remains 
below a predetermined value for the threshold tim^ period. 



8 



23. (Original) A method as recited in claim 21 , wherein said determining an 
intonation of the utterance comprises using an intonation model. 



24. (Original) A method of operating an endpoint detector for speech 
recognition, the method comprising: 

inputting speech representing an utterance, the utterance having a time- 
varying fundamental frequency; 

determining an intonation of the utterance by 

computing the intonation as the fundamental frequency of the 
of time, ai|ici 

referencing theyintohalion against an intonation model to determine 



utterance as a function 



the intonation of the utterance; 

if the intonation of the utterance is determined to be generally decreasing, 
then setting a threshold time period eaual to a first time value; 

if the intonation of the utterance is determined not to be generally 
decreasing, then setting the threshold (ime period equal to a second time value 
larger than the first time value; and 

using the threshold time period td identify an endpoint of the utterance, by 
determining that an endpoint of the utterance has occurred if the speech remains 
below a predetermined value for a length Ipf time equal to the threshold time 
period. 
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25. (Original) A machinevreadable program storage medium tangibly embodying 
a sequence of instructionsWecutable by a machine to perform a method for 
endpoint detection, the metpod^oraprising: 

inputting speech presenting an utterance, the utterance having an 




intonation; and 

identifying an endpoint 6/ the utterance based on the intonation of the 
utterance. 

26. (Original) A machine-readable program storage medium as recited in claim 
25, wherein said using the intonation of the utterance in identifying an endpoint of 
the utteraVice comprises comparing the intonation of the utterance with an 
intonation model. 

27. (Original) A machine-readable program storage medium as recited in claim 
25, wherein t\ie method further comprises determining the intonation of the 
utterance. 

28. (Original) AVnachine-readable program storage medium as recited in claim 
27, wherein said determining the intonation of the utterance comprises 
computing the fundamental frequency of the utterance. 

29. (Original) A maclnine-readable program storage medium as recited in claim 
27, wherein said determining the intonation of the utterance comprises using an 
intonation model to determine the intonation of the utterance. 
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30. (Original) A machinevreadable program storage medium as recited in claim 
25, wherein the method further comprises: 

determining a period of time for which an energy value of the speech has 
been below a threshold value; and 

wherein said identifying an endpoint of the utterance comprises identifying 
the endpoint of the utterance! further based on the period of time. 



31 . (Original) A machine-readable program storage medium as recited in claim 



25, wherein the method furthei 



wherein said identifying 



comprises: 



determining a duration cf a final syllable of the utterance; and 



in endpoint of the utterance comprises identifying 



the endpoint of the utterance further based on the duration of a final syllable of 



the utterance. 



32. (Original) A machHne-readable program storage medium as recited in claim 
31, wherein the method fiurther comprises: 

determining a period of time that has elapsed since a value of the speech 
dropped below a threshof 
wherein said identifyir 



lue^-and 

Van endboint of the utterance comprises identifying 



the endpoint of the utterance further/based on the period of time. 



33. (Original) An endpoint detected comprising: 

means for inputting speech representing an utterance, the utterance 
having an intonation; and 



• 



means for identifying an endpoint of the utterance based on the intonation 
of the utterance 

34. (Original) An endpoint detector as recited in claim 33, wherein said means 
for using the intonation of the utterance in identifying an endpoint of the utterance 
comprisis means for comparing the intonation of the utterance with an intonation 
model. 

35. (Original) An endpoint detector as recited in claim 33, further comprising 
means for Determining the intonation of the utterance. 

36. (Original) An endpoint detector as recited in claim 35, wherein said means 
for determining the intonation of the utterance comprises means for computing 
the fundamental frequency of the utterance. 

37. (Original) An endpoint detector as recited in claim 35, wherein said means 
for determining theNintonation of the utterance comprises means for using an 
intonation model to determine the intonation of the utterance. 



38. (Original) An endpoint detector as recited in claim 33, further comprising: 

means for determinir/fgNj period of time that has elapsed since a value of 
the speech dropped below a%ffleshold value; and 

wherein said means for identifying an endpoint of the utterance comprises 
means for identifying the endpointpf the utterance further based on the period of 
time. 
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39. (Original)^ An endpoint detector as recited in claim 33, further comprising: 
means for determining a duration of a final syllable of the utterance; and 
wherein said means for identifying an endpoint of the utterance comprises 
means for identifying the endpoint of the utterance further based on the duration 
of a final syllable\of the utterance. 



40. (Original) An endpoint detector as recited in claim 39, further comprising: 
means for determining a period of time that has elapsed since a value of 

the speech dropped )elow a threshold value; and 

wherein said means for identifying an endpoint of the utterance comprises 

means for identifying {he endpoint of the utterance further based on the period of 

time. 



41 . (Original) An apparatus for performing endpoint detection comprising: 
means for inputting speech representing an utterance, the utterance 

having a time-varying fundamental frequency; 

means for determining that a value of the speech has dropped below a 

threshold value; 

means for compiling an intonation of the utterance by determining the 
fundamental frequency c f the utterance as a function of time; 

means for referencing the intonation of the utterance against an intonation 
model to determine a first end-of-utterance probability; 

means for determining a period of time that has elapsed since the speech 
dropped below the threshold value; 
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;inq 



means for referencing t 



means for referencing the period of time against an elapsed time model to 
determine a second end-of-utt trance probability; 

the duration of the final syllable of the utterance 
against a syllable duration moqel to determine a third end-of-utterance 
probability; 

means for computing an dverall end-of-utterance probability as a function 
of the first, second, and third end-of-utterance probabilities; and 

>t)u 



means for determining whetl 



comparing the overall end-of-utteramce probability to a threshold probability. 



er an end-of-utterance has occurred by 
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